Stochastic volatility of financial markets as the fluctuating rate of trading: 

an empirical study 
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We present an empirical study of the subordination hypothesis for a stochastic time series of a 
stock price. The fluctuating rate of trading is identified with the stochastic variance of the stock 
price, as in the continuous-time random walk (CTRW) framework. The probability distribution of 
the stock price changes (log-returns) for a given number of trades N is found to be approximately 
Gaussian. The probability distribution of A'^ for a given time interval At is non-Poissonian and has 
an exponential tail for large A'^ and a sharp cutoff for small N. Combining these two distributions 
produces a nontrivial distribution of log-returns for a given time interval At, which has exponential 
tails and a Gaussian central part, in agreement with empirical observations. 

PACS numbers: 89.65.Gh, 05.40.Fb, 
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Introduction: stochastic volatility, subordination, and 
fluctuations in the number of trades. The stock price St 
is a stochastic series in time t. It is commonly character- 
ized by the probability distribution PAt{x) of detrended 
log-returns x = lii{St2/ St-^) ~ M^^: where the time inter- 
val At = t2 — ti is called the time lag or time horizon, and 
/i is the average grovifth rate. For a simple multipUcative 
(geometric) random walk, the probability distribution is 
Gaussian: PAt{x) oc exp{—x'^/2vAt), where v — a'^ is 
the variance, and a is the volatihty. However, the empir- 
ically observed probabiUty distribution of log-returns is 
not Gaussian. It is well known that the distribution has 
power-law tails for large x [Ij, [3- However, the distribu- 
tion is also non-Gaussian for small and moderate x, where 
it follows the tent-shaped exponential (also called double- 
exponential) Laplace law: PAt{x) oc exp(— cjccj/V At), as 
emphasized in Ref. Q . The exponential distribution was 
found by many researchers [4, 5, 6, 7, 8, 9, 10, 11], so it 
should be treated as a ubiquitous stylized fact for finan- 
cial markets Q. 

In order to explain the non-Gaussian character of the 
distribution of returns, models with stochastic volatility 
were proposed in literature l3, 13, 14, 1^. If the vari- 
ance vt changes in time, then vAt in the Gaussian dis- 
tribution should be replaced by the integrated variance 
VAt = Vt dt. If the variance is stochastic, then we 
should average over the probability distribution QAtiV) 
of the integrated variance V for the time interval At: 



PAt{x) 



dV- 



-x'^/2V 



■QAtiV). 



(1) 
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The representation ^ is called the subordination [Til . 

In this approach, the non-Gaussian character of 
PAtix) results from a non-trivial distribution QAt{V). 

In the models with stochastic volatility, the variables v 
or V are treated as hidden stochastic variables. One may 
try to identify these phcnomcnological variables with 
some empirically observable and measurable components 
of the financial data. It was argued 13, 3 23| that the 



integrated variance Va* may correspond to the number 
of trades (transactions) NAt during the time interval At: 
VAt — £,NAt, where ^ is a coefficient [21]. Every transac- 
tion may change the price up or down, so the probability 
distribution Pn{x) after N trades would be Gaussian: 



PNix) 



Then, the subordinated representation ([T|) becomes 



PAtix) 



dN , ^ 

V2<7V 



KAtiN), 



(2) 



(3) 



where KAtiN) is the probability to have N trades dur- 
ing the time interval At. (We assume that N is large 
and use integration, rather than summartion, over N.) 
In this approach, the stochastic variance v reflects the 
fluctuating rate of trading in the market. 

Performing the Fourier transform of ([3|) with respect 
to X, we find that the characteristic function PAtikx) 
is directly related to the Laplace transform KAti^N) of 
KAtiN) with respect to N, where kx and kN are the 
Fourier and Laplace variables conjugated to x and N: 



PAtikx) 



dNe-^^f^i/^KAtiN) = KAti^kl/2). (4) 



In this paper, we study whether the subordinated rep- 
resentation ([31) agrees with financial data. First, we ver- 
ify whether Pwix) is Gaussian, as suggested by Eq. ([2]). 
Second, we check whether empirical data satisfy Eq. ([4]). 
Third, we obtain KAtiN) empirically and, finally, discuss 
whether PAtix) constructed from Eq. ^ agrees with the 
data. Refs. |1B, i20i] have already presented evidence in 
favor of the first conjecture; however, the other questions 
were not studied systematically in literature. 

The subordination was also studied in physics lit- 
erature as the continuous-time random walk (CTRW) 
[H [23. Refs. di, [H, m focused on the probabihty 
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FIG. 1: The variance of log-returns {x^)n after trades 
plotted vs. A''. 



distribution of the waiting time At between two conse- 
qutive transactions (AA^ — Our approach is to study 
the distribution function K^t{N), which gives comple- 
mentary information and can be examined for a wide va- 
riety of time lags. In Ref. [i^l, this function was studied 
for some Russian stocks. 

We use the TAQ database from NYSE [28|], which 
records every transaction in the market (tick-by-tick 
data). We focus on the Intel stock (INTC), because it is 
highly traded, with the average number of transactions 
per day about 2.5 x lO"*. Here we present the data for 
the period 1 January - 31 December 1999, but we found 
similar results for 1997 as well [1^. Because of difficul- 
ties in dealing with overnight price changes, we limit our 
consideration to the intraday data. Since At is relatively 
short here, the term fiAt is small and can be neglected. 
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FIG. 2: The variance of log-returns {x^)At (upper points, left 
scale) and the average number of trades {N)At (lower points, 
right scale) vs. the time lag At. The solid lines of slope 1 
represent the proportionality relations ((S}. 
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FIG. 3: Cumulative distribution Cn{x / gm) of normalized log- 
returns after N trades, where = {x'^)n, compared with the 
Gaussian distribution (dashed curve). N/ri is the typical time 
interval between A'^ trades. 

Probability distribution of log-returns x after N trades. 
It follows from Eq. ^ that {x^)n — , where {x^)n is 
the second moment of x after N trades. It is also natural 
to expect that the average number of trades (iV)At during 
the time interval At is proportional to At with some 
coefficient rj. Thus, we expect 

{x^)m = ^N, (TV) At = 77At, {x^)At = 0At, e - C»7- (5) 

Notice that the coefficient 9 = (v) is the mean variance. 
Figs. [T] and [5] show that the relations (O are indeed sat- 
isfied. We extract the values of the coefficients from the 
slopes of these plots: ^ = 2.4 x 10~® per one trade, 
7y = 3.8 X 10'^ trades/hour, and 6* = 9.5 x 10~^ per hour. 
The relation 6* = ^77 is satisfied only approximately, but 
within the measurement accuracy. 

In Figs. [3]and|4l we examine the empirical probability 
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FIG. 4: The parametric Q-Q plot of the empirical cumula- 
tive distribution C]v(a;/crjv) of normalized log-returns vs. the 
Gaussian distribution for the parameter x from —00 to -|-cx3. 
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FIG. 5: The parametric plot of the Fourier transform PAt{kx) 
vs. the Laplace transform K/^t{(,k'i/2) verifying Eq. @ for the 
parameter kx from — oo to +oo. 
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FIG. 6: Empirical cumulative distribution CAt{N) for the 
number of trades during the time interval At, plotted in 
the log-linear scale. The solid lines illustrate exponential tails. 



distribution Pn{x) of log-returns x after N trades. In 
Fig. [21 the cumulative distribution functions Cat (a;) — 
J^^dx'PN{x') for a; < and 1 - CnIx) for a; < 
are compared with the Gaussian distribution shown by 
the dashed line. The log-return x is normalized by 
CTjv = \/ {x"^) N- The empirical distributions Pn{x) for 
different N agree with the Gaussian in the central part, 
but there are deviations in the tails, as expected for large 
Similar results were found in Fig. 6 of Ref. [3Q |. 

Fig. 3] shows the Q-Q plot similar to the one con- 
structed in Ref. [l^. This is a parametric plot, where 
the vertical axis shows the empirical CAr(a;/ajv), and the 
horizontal axis shows the cumulative Gaussian distribu- 
tion of x/a, whereas the parameter x changes from —00 
to -l-oo. The plots for different N are all close to the di- 
agonal, which indicates agreement between the empirical 
and the Gaussian distribution functions. Fig. |4] empha- 
sizes the central part of the distribution, whereas Fig. [3] 
emphasizes the tails. Overall, we conclude the empirical 
distribution PNix) is reasonably close to the Gaussian in 
the central part, so Eq. Q is approximately satisfied. 

When the time lag approaches one day, the number of 
data points become too small to construct reliable prob- 
ability densities, so we cannot verify the Gaussian hy- 
pothesis beyond the intraday data. When the time lag 
is too short, and the corresponding N is small, the log- 
returns are discrete and cannot be described by a con- 
tinuous function, such as Gaussian. We found that the 
distribution of x becomes reasonably smooth only after a 
thousand of trades [2^ . Discreteness of the distribution 
for small N can be seen in Fig. 11 of Ref. 31 1. 

The characteristic function for log-returns and the 
Laplace transform for the number of trades. The sub- 
ordination hypothesis ([3]) can be examined further by 
checking the relation Q between the Fourier trans- 
form PAt{kx) for log-returns and the Laplace trans- 



form K/^t{kN) for the number of trades. These func- 
tions can be directly constructed from the data. As 
shown in Ref. Q, PAt(fcx) « (V") E"=i e*''""'^ and 
K\t{kN) ~ (l/'Ti) where the sums are 

taken over all occurrences of the log-returns Xj and the 
numbers of trades Ni during a time interval At in a 
dataset. Because the frequency of appearances of a given 
Xj or Ni is proportional to the corresponding probabil- 
ity density, these sums approximate the integral defini- 
tions pAt{kx) = f^^ dxe^''^^PAt{x) and KAt{kN) = 
J^dNe-^-^KAtiN). 

In Fig. m we show the parametric plot of pAtikx) vs. 
KAt{£.kl/2). The vertical axis shows PAt{kx), and the 
horizontal axis shows K At{£,k'^ / 2) , whereas the parame- 
ter kx changes from from —00 to +00. The upper right 
corner (1, 1) corresponds to kx — 0, and the lower left 
corner (0,0) corresponds to large |fc^|. The parameter ^ 
used in Fig.l^lis extracted from the slope of {x"^) n vs. N 
in Fig. [TJ The relations 



d^PAtikx) 



dkl 



= -{x^)At, 



dKAt(kN) 



dkN 



= -{N) 



At 



(6) 

and Eq. ([5]) ensure that the slope of the parametric plot 
near the point (1,1) corresponds to the diagonal. Overall, 
the plots for different At in Fig. [5] are close to the diago- 
nal, but deviate in the lower corner for large \kx\, which 
indicates that the subordination relation ^ is satisfied 
only approximately. Notice that no assumptions about 
the functional form of Kai (N) are made in Eq. (|4]) . The 
only assumption is that PNix) is Gaussian ([2|), and the 
distributions of x and N are uncorrelated, so they can be 
combined in Eq. 

Probability distribution of the number of trades N dur- 
ing the time interval At. Fig. [S] shows the log- linear 
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([3]). Substituting these expressions into Eq. ([3]), we get 
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FIG. 7; Probability density PAt{x/a/\t) of normalized log- 
returns after the time lag At, where — {x^)At- The solid 
lines are fits to the Heston model with I/7 — 50 min. The 
curves are offset vertically by the factors of 10. 



plot of the empirically constructed cumulative distribu- 
tion CAtiN) = KAt{N') dN' for the number of trades 
N during the time interval At. The straight lines are 
eye guides, which indicate that the probability distribu- 
tions KAt{N) are exponential for large N. The slopes 
of the lines are related to (iV)At — rjAt, so we can ap- 
proximate KAt{N) oc exp(— A^/yyAt) for large N. For 
small N, Fig. El shows that CAt{N) is flat and KAtiN) 
is suppressed, so that KAt{N = 0) = 0. It is indeed 
very improbable to have no trades at all for an extended 
time period At. For long enough At, we expect that 
KAt{N) would become a Gaussian function of N cen- 
tered at {N)At — TjAt. However, this regime has not 
been achieved yet for the time lags At shown in Fig. [6l 
For short At, we also found that {N'^)At oc {N)^^ with a 
coefficient somewhat smaller than 2, as expected for an 
approximately exponential distribution. 

Notice that the exponential behavior of the empiri- 
cal KAt{N) shown in Fig. [6] is inconsistent with the 
Poisson distribution K^f''^°''{N) = e-''^* {r]At)^ /N\ ex- 
pected for trades occurring randomly and independently 
at the average rate rj. It was suggested in literature 
that KAtiN) may be approximated by the log-normal 
or gamma distributions. We do not attempt to discrimi- 
nate between the alternative hypotheses here, but some- 
times these functions may look alike [s^ . A qualitatively 
similar distribution KAt{N) was found for some Russian 
stocks in Ref. [27]. 

Probability distribution of log-returns x after the time 
interval At. Having established that Pn{x) is approxi- 
mately Gaussian and KAtiN) is approximately ex- 
ponential for short At, we can obtain PAtix) from Eq. 



P^tix) 



dN 



-\x\^/TlBKi 



V'2<]V TyAt 



^mKt 



(7) 

Eq. ([7]) shows that the exponential distribution of the 
number of trades iV results in the exponential (Laplace) 
distribution of log-returns x. This can be understood as 
follows. The integral ([7|) can be taken exactly, but one 
can also evaluate it approximately by integrating around 
the optimal value of N^ = \x\y^riAt/2^ that minimizes 
the negative expression in the exponent of Eq. ([7]) and 
maximizes the integrand. We see that the probability to 
have a given log-return x is controlled by the probability 
to have the optimal number of trades N^ix). Thus, the 
distribution PAti^) has the fatter (exponential) tails than 
Gaussian, because the probability to have a large x is 
enhanced by fluctuations with large N. 

On the other hand, for very small x, the optimal value 
TV* becomes limited by the cutoff in KAtiN) for small N. 
At this point, the optimal value N^ stops depending on 
X, so PAtix) becomes Gaussian. Thus, we expect to see 
the Gaussian behavior in PAtix) for small \x\ and the 
exponential behavior for medium and large \x\. Fig. [7| 
shows a log-linear plot of the empirical probability den- 
sity PAtix). In agreement with the qualitative analysis 
presented above, we observe that the data points follow 
the parabolic (Gaussian) curve for small |x| and fall on 
the straight (exponential) lines for large |a;|. The range 
of X occupied by the Gaussian expands when the time lag 
At increases, because the cutoff in KAtiN) for small N 
increases with the increase of At, as shown in Fig. [6l We 
conclude that the subordination hypothesis ^ is qualita- 
tively valid, and, particularly, it explains the exponential 
distribution PAtix) for result of the exponential 

distribution KAtiN) for the number of trades N. 

The solid lines in Fig. [7] show fits of the data to the 
Heston model. The Heston model 1J| is a model with 
stochastic volatility, which has the advantage of being 
exactly solvable. A closed- form solution for PAti^) was 
obtained in Ref. [s^l , and Fig. [7] shows fits of the data 
to the formula derived there. Refs. [3, [s^l pointed out 
that PAtix) in the Heston model has the exponential 
tails and Gaussian center, in qualitative and quantitative 
agreement with the empirical distribution of log-returns. 
Given the verification of the subordination hypothesis 
presented in this paper, one may ask whether the Heston 
model describes the probability distribution KAtiN) for 
the number of trades N. A detailed study of this question 
will be presented in a separate paper [3j| . 

We also would like to point out that Eq. ([7]) represents 
a special case of the variance- gamma distribution intro- 
duced by Madan and Seneta [35[. The Heston model 
solution [3^ reduces to the variance-gamma distribution 
in the limit of short At, see Eqs. (48) and (49) in Ref. 
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